STDP with synaptic fatigue for learning of spike-time-coded patterns in the presence of parallel rate-coding

ABSTRACT

A circuit implementing a spiking neural network that includes a learning component that can learn from temporal correlations in the spikes regardless of correlations in the rates. In some embodiments, the learning component comprises a rate-discounting component. In some embodiments, the learning rule computes a rate-normalized covariance (normcov) matrix, detects clusters in this matrix, and sets the synaptic weights according to these clusters. In some embodiments, a synapse with a long-term plasticity rule has an efficacy that is composed by a weight and a fatiguing component. In some embodiments, A Hebbian plasticity component modifies the weight component and a short-term fatigue plasticity component modifies the fatiguing component. The fatigue component increases with increases in the presynaptic spike rate. In some embodiments, the fatigue component increases are implemented in a spike-based manner. In some embodiments, the Hebbian plasticity is a spike-timing-dependent plasticity (STDP), resulting in a fatiguing STDP (FSTDP) synapse.

BACKGROUND

The present invention relates to the technical field of artificialintelligence. In particular, the present invention relates tospike-timing-dependent plasticity (STDP) in neurons.

Neurons are electrically excitable cells that process and transmitinformation around the nervous system. Neurons are a primary componentof the brain and spinal cord in vertebrates, and are a primary componentof the ventral nerve cord in invertebrates. These neurons include anucleus that is comprised of a soma, an axon, and a dendrite. The somaacts as a processing unit for neuronal signals and is responsible forgenerating action potentials (i.e., electrical signals or potentials).The action potentials are propagated from the soma, through an axon, tothe end of the neuron, or axon terminal. In the axon terminal, chemicalneurotransmitters that encode the electrical signal are produced. Thesechemical neurotransmitters cross a gap between the axon terminal and adendrite of another neuron. This gap is part of the connection system oftwo neurons and is referred to as a synapse. Synapses can becharacterized by their synaptic plasticity. Synaptic plasticity is theability of a synapse to strengthen or weaken over time in response toinputs.

Neural networks are a computational model used in artificialintelligence systems. Neural networks are based on multiple artificialplastic neurons, which are analogous to the axons in the human brain.Each artificial neuron is connected with many others, and links canenhance or inhibit the activation state of adjoining neural units.Artificial neurons, in some embodiments, compute using summationfunctions, and there may be a threshold function or limiting function oneach connection and on the artificial neuron itself, such that thesignal must surpass the limit before propagating to other artificialneurons. This thresholding is a key component of the spiking neuralnetwork (SNNs).

SNNs are algorithms that underlie the brain's function, and theirimplementation as a collection of artificial neurons. Many SNNs areloosely based on biological neurons, and the action potentials ofbiological neurons are modeled as spikes. These SNNs learn from inputdata whose features are encoded as streams of spikes. These networkshave applications in unsupervised learning through detectingcorrelations across the features of the input. One aspect of theartificial neuron that leads to learning correlation is the Hebbianplasticity, i.e. a class of synaptic plasticity rules that modifysynapses according to pre- and post-synaptic activity.Spike-timing-dependent plasticity (STDP) is a Hebbian synapticplasticity rule that can allow an SNN to learn from correlations in thetiming of individual spikes across input features, by detecting clustersin the covariance matrix of the inputs. These networks can use thetiming of the spikes to learn.

However, STDP is rate-sensitive, meaning that in conventional neuralnetworks, STDP learns from correlations in rate. If, in addition tocorrelations of individual spike timings there are correlations in theinput rates, which are not relevant to the task, STDP may learn from therates and not from the timings. Thus, for tasks where correlations intiming but not rate are wanted, neurons with only aspike-timing-dependent plasticity often fail.

SUMMARY

In a first aspect of the present invention, an apparatus includes: atleast one presynaptic artificial neuron generating a sequence ofpresynaptic spikes having a timing, a postsynaptic artificial neuroncomprising a membrane potential, a learning rule component comprising asynaptic efficacy and a synaptic plasticity, and the learning ruleconfigured to modify the synaptic efficacy by a learning rule. Thelearning rule is based on the timing and discounts presynaptic spikerates.

In a second aspect of the present invention, an apparatus includes: atleast one presynaptic artificial neuron generating a sequence ofpresynaptic spikes, a postsynaptic artificial neuron including apostsynaptic membrane potential and at least one fatiguing plasticsynapse connected to the postsynaptic membrane potential. Thepostsynaptic artificial neuron configured to receive the set of spikes.The at least one fatiguing plastic synapses includes an efficacy havinga spike-timing-dependent plasticity and a fatiguing component. Thefatiguing component reduces the efficacy based on the set of inputspikes.

In a third aspect of the present invention, an apparatus includes: atleast one presynaptic artificial neuron, configured to generate asequence of presynaptic spikes, a postsynaptic artificial neuronincluding a postsynaptic membrane potential and at least one plasticsynapse configured to receive the set of spikes and to modify thepostsynaptic membrane potential. The at least one plastic synapse learnsbased on a normalized covariance of the sequence of presynaptic spikes.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the presentinvention can be understood in detail, a more particular description ofthe present invention, briefly summarized above, may be had by referenceto embodiments, some of which are illustrated in the appended drawings.It is to be noted, however, that the appended drawings illustrate onlytypical embodiments of the present invention and the present inventionmay admit to other equally effective embodiments.

FIG. 1A illustrates a circuit implementing a neural network with alearning rule with a rate-discounting element, according to anembodiment of the present invention.

FIG. 1B illustrates a circuit implementing a normcov-based learningneural network, according to an embodiment of the present invention.

FIG. 1C illustrates a circuit implementing an FSTDP synapse, accordingto an embodiment of the present invention.

FIG. 2 illustrates a device having an integrate-and-figure neuron with aset of FSTDP synapses attached.

Other features of the present embodiments will be apparent from theDetailed Description that follows.

DETAILED DESCRIPTION

In the following detailed description of the preferred embodiments,reference is made to the accompanying drawings, which form a parthereof, and within which are shown by way of illustration specificembodiments by which the present invention may be practiced. It is to beunderstood that other embodiments may be utilized and structural changesmay be made without departing from the scope of the present invention.Electrical, mechanical, logical and structural changes may be made tothe embodiments without departing from the spirit and scope of thepresent teachings. The following detailed description is therefore notto be taken in a limiting sense, and the scope of the present disclosureis defined by the appended claims and their equivalents.

The methods and systems disclosed herein may be implemented in any meansfor achieving various aspects, and may be executed in a form of amachine-readable medium embodying a set of instructions that, whenexecuted by a machine, cause the machine to perform any of theoperations disclosed herein. These and other features, aspects, andadvantages of the present invention will become readily apparent tothose skilled in the art and understood with reference to the followingdescription, appended claims, and accompanying figures, the presentinvention not being limited to any particular disclosed embodiment(s).

Described herein is an artificial neuronal circuit that learns fromtemporal correlations in the timings of input spikes even in thepresence of additional correlations in the spike rates. The learningrule can be based on a normalized covariance (normcov) matrix of theinputs. A synaptic plasticity rule is also described, which, in aspects,comprises a rate-normalizing component in the plastic synapses, and aHebbian or anti-Hebbian plasticity component. The synaptic plasticityrule can be implemented by a spike-timing-dependent plasticity rule thatcomprises a “synaptic fatigue” type of plasticity. This synapticplasticity rule can implement the normcov-based learning

To learn from temporal correlations regardless of the spike rate, aneural network that encompasses an element that discounts the influenceof spike rates in the learning is introduced.

In one implementation of this learning rule, synapses are modifiedaccording to the normalized covariance matrix. The normalized covariancedefined as:

${{{norm}\mspace{14mu}{{cov}\left( {X_{i},X_{j}} \right)}} = {E\left\lbrack {\frac{X_{i}}{E\left\lbrack X_{i} \right\rbrack} \cdot \frac{X_{j}}{E\left\lbrack X_{j} \right\rbrack}} \right\rbrack}},$where X_(i) and X_(j) are two input streams. Synapses that correspond tothe cluster of strongly covarying inputs are potentiated according tonormcov.

Additionally, a Hebbian synaptic plasticity rule that comprises arate-normalizing component is described. Rate normalization in thiscontext means that the normalization of postsynaptic effects ofpresynaptic spikes is according to the presynaptic spike rate. Therate-normalizing component can be accommodated by synaptic fatigue.Synaptic fatigue in biology is the temporary reduction in the efficacyof a synapse for a time window immediately following an event which wasinput to the synapse. This reduces the probability that high frequencystreams will cause the post-synaptic neuron to spike, and leavessynapses that receive low frequency streams relatively uninfluenced.

Spike-timing-dependent plasticity potentiates synapses that receiveinput which is causally linked to the post-synaptic spikes. Thus, afatiguing synapse enables the detector to properly function even whenthe correlated input streams are of relatively low-frequency, byeliminating those causal links between pre-synaptic and post-synapticevents that would be merely due to the high pre-synaptic frequency ofuncorrelated synapses, without influencing the synapses that receivelow-frequency but correlated event streams.

In an embodiment, an apparatus comprises a presynaptic artificial neurongenerating a first set of spikes, a postsynaptic artificial neuroncomprising a postsynaptic membrane potential and generating a second setof spikes, and a fatiguing plastic synapse operatively connected to thepresynaptic artificial neuron to receive the first set of spikes andconfigured to modify the postsynaptic membrane potential based on afatiguing component and a spike-timing-dependent plasticity.

In a preferred embodiment, the fatiguing plastic synapse comprises aspike-timing-dependent plasticity. In a further preferred embodiment,the first set of spikes is characterized by an input rate and thefatiguing component is configured to reduce the postsynaptic membranepotential when the input rate increases. In an optional embodiment, thefatiguing component comprises a fatiguing rule:(t)=1−1/(R(t)+1),wherein R(t) is a rate component.

In an advantageous embodiment, the rate component is a mean over a timewindow. In an optional embodiment, the fatiguing component comprises afatiguing rule:F(t)=ƒ(t−t ^(sp) ^(n) )+F(t ^(sp) ^(n) ),where t^(sp) ^(n) is the timing of the last presynaptic spike prior tot.

In other optional embodiments:

${f(x)} = \left\{ {\begin{matrix}{a,} & {0 < x < x_{window}} \\{0,} & {x \geq x_{window}}\end{matrix},{a \in \left( {0,1} \right\rbrack},{{f(x)} = {1 - {ax}}},{a > 0},{{{or}{f(x)}} = {1 - {ae}^{- x}}},{a > 0.}} \right.$

In an alternative embodiment, the fatiguing plastic synapse comprises atransistor having a gate input, wherein the set of presynaptic spikesare input into the gate so as to modify the amount of current passingthrough the transistor. In a preferred embodiment, modifying the amountof current passing through the transistor is in accordance with afatigue rule.

Numerous other embodiments are described throughout herein. All of theseembodiments are intended to be within the scope of the present inventionherein disclosed. Although various embodiments are described herein, itis to be understood that not necessarily all objects, advantages,features or concepts need to be achieved in accordance with anyparticular embodiment. Thus, for example, those skilled in the art willrecognize that the present invention may be embodied or carried out in amanner that achieves or optimizes one advantage or group of advantagesas taught or suggested herein without necessarily achieving otherobjects or advantages as may be taught or suggested herein.

In a set of event streams, the timing of events may be correlated acrosscertain streams. Detection of the subset of correlated streams hasnumerous applications. An artificial spiking neuron's synapses equippedwith STDP can be used to detect this correlation if the streams are ofroughly equal and constant mean frequencies. This detection is possiblebecause synapses receiving correlated streams of events tend to bepotentiated, i.e. strengthened, while synapses receiving uncorrelatedstreams of events tend to be depressed, i.e. weakened. Thus correlationdetection is done through synaptic strength. However, if someuncorrelated streams are of high mean frequency relative to thecorrelated ones, this neurosynaptic correlation detector can yield falseresults, as synapses receiving high input frequencies are more likely tobe potentiated than synapses with low input frequency.

STDP refers to a neuronal characteristic for which the efficacy ofsynaptic transmission is influenced by a relative timing betweenpresynaptic and postsynaptic stimulation. That is, the efficacy isaffected by the timing of a presynaptic action potential and apostsynaptic potential. STDP has also become a principal experimentalprotocol for inducing change in synaptic plasticity.

The strength of a synapse is related to a magnitude of a change inpostsynaptic current. Changes in synaptic strength can be short term(short term potentiation/depression, or STP/STD), which causes nopermanent structural changes in the neuron's synapse. Typically, thischange lasts a few seconds to minutes. Alternatively, changes insynaptic strength can also be long term (long termpotentiation/depression, or LTP/LTD). LTP can cause permanent ofsemi-permanent changes of the synaptic weight, and are typicallydependent on the relative timing of pre- and post-synaptic spikes(Δt=t_(pre)−t_(post)). For these long-term changes, repeated orcontinuous synaptic activation results in an alteration of the structureof the synapse itself. Learning and memory are believed to result fromlong-term changes in synaptic strength. A neuron equipped with STDPsynapses can learn to recognize consistent spatiotemporal spike patterns(e.g. Song et al., 2000; Gutig et al., 2003; Masquelier et al., 2008).

Synaptic fatigue is a form of short-term plasticity (STP). Changessynaptic efficacy transiently. STP is observed in biological synapses asdepletion of the neurotransmitter in the pre-synaptic terminal by eachtransmitted spike, and subsequent gradual replenishment. STDP can learnspike timing patterns, but not in the presence of rate coding. Fatiguingspike-timing-dependent plasticity (FSTDP) is a long-term synapticplasticity rule which employs short-term synaptic fatigue and long-termSTDP to learn precisely-timed spike patterns, discounting the influenceof pre-synaptic spike rates on learning.

The shapes of action potentials at synapses are carefully modeled by thevoltage responses propagated through axon and dendrite. Synapticplasticity, the ability of connections between neurons to change instrength, can be viewed as an underlying mechanism of learning andmemory. Spike-timing-dependent plasticity refers to the phenomenon, inwhich the synaptic weight can either be long term potentiated (LTP) ordepressed (LTD), depending on the relative timing between presynapticand postsynaptic spikes.

The concept of synaptic strength leads to the concept of a strongsynapse as opposed to a weak synapse. For a strong synapse, an actionpotential in the presynaptic neuron triggers another action potential inthe postsynaptic neuron. Conversely, for a weak synapse, an actionpotential in the presynaptic neuron may not trigger another actionpotential in the postsynaptic neuron. In other words, for a weaksynapse, an excitatory postsynaptic potential in the postsynaptic neuronmay not reach the threshold for action potential initiation in thepostsynaptic neuron.

Thus, synaptic plasticity refers to an ability of the synaptic strengthbetween two neurons to change. The above-described synaptic plasticityrefers to and results in long term changes in the synaptic strength of asynapse, either long term potentiation (LTP) or long term depression(LTD).

However, challenges arise in cases of significant rate variability.Timing of events recorded by sensors is rich in information, but it isoften combined with differences in the events' rate over time or acrosssensors. Research has shown that rate dominates STDP and thatspike-time-coding is equivalent to rate-coding for slow modulation ofrate (Kempter et al., 1999). Furthermore, STDP allows fastrate-modulated coding with Poisson-like spike trains (Gilson et al.,2011). In other words, STDP prohibits spike-time coding when the rate ismodulated.

FIG. 1A illustrates a network 100 learning from spike inputs whilediscounting the influence of input spike rates, according to anembodiment of the present invention. The network 100 comprises at leastone pre-synaptic neuron 140, at least one post-synaptic neuron 150, andat least one synapse 111 with a synaptic efficacy. The network 100 canlearn temporal correlations in input spikes from the pre-synaptic neuronregardless of the input rates. Each input has timed spikes and may haveadded rate-dependent spikes. Some inputs are spike-timing-correlated.The learning rule component 110 outputs a set of aimed efficacies 116during operation of the network. The learning rule component 110comprises a spike-rate component 115. The spike-rate component 115computes the rates of the presynaptic input spikes. The learning rulecomponent 110 sets the aimed weights 116 according to the input spikesafter discounting the spike-rate component 115.

Learning a task from a spike-timing code consists of learning to detectthe temporal correlations across the streams that carry the code.Therefore, in a spiking neural network, the neurons need to learn toassociate the strongly covarying inputs. This can be well illustrated asthe discovery of clusters of high covariance in the covariance matrix ofthe input synapse pairs. In STDP, a neuron approximates this bypotentiating, i.e., strengthening, and depressing, i.e., weakening, thesynapses whose input shows high and low covariance respectively with theneuron's own output, i.e. the weighted sum of the input streams. This isa good approximation because inputs that covary strongly with theneuron's output which they drive likely covary strongly with oneanother. So covariances between inputs define the learning task, andcovariances of the inputs with the sum of all inputs set a predictionfor the neuron's optimal approximation to the task. The neuron's STDPsynapses compare the inputs directly with the neuron's output, so aneuron's learning is sensitive specifically to the uncentered covarianceof the inputs. If in addition to the covariances introduced by thecorrelations in timings of individual spikes there are covariancesintroduced by correlations in the rates of the inputs, theserate-induced covariances dominate the uncentered covariance, because ofthe spurious correlations of individual spike timings that are added bythe slower covarying rates. To detect the fast covariances in thepresence of the slow ones, individual spikes from high-rate channelsmust contribute less to the computed covariance than those from low-ratechannels.

FIG. 1B illustrates a network 101 configured with a learning rulecomponent 110 that outputs the aimed synaptic efficacies according toclusters found in the normalized covariance (normcov) matrix of theinputs, according to an embodiment of the present invention. In thisembodiment, the normcov component 117 of the learning rule component 110calculates the normcov. The output of the normcov component 117 is thenclustered by a clustering component 118, which modifies the at least onesynaptic efficacy component 111. The normcov component 117 computes thenormcov and sets the synaptic efficacies component 111 according toclusters found in the normalized covariance (normcov) matrix of theinputs. At least one pre-synaptic neuron 140 and at least onepost-synaptic neuron 150 are connected to the at least one synapticefficacy component 111. For a neuron to learn a spike-timing code in thepresence of a rate code, a modified STDP rule is needed. Ideally, thisrule should approximate the learning of covariances of rate-normalizedinputs via learning the covariances of rate-normalized inputs with theneuron's own output. In this disclosure, an STDP rule that includes acomponent that normalizes the postsynaptic contributions of eachpresynaptic spike by an increasing function of the recent rate isintroduced. The rule is referred to as fatiguing STDP (FSTDP), becausethe rate normalization component is a synaptic fatigue mechanism, whichcan be combined with STDP. FSTDP can be used to implement normcov-basedlearning. Therefore, FSTDP can be used to implement spike-timing-basedlearning that discounts rate-correlations.

Synaptic fatigue has some postsynaptic effects. In particular, itreduces the effect of pre-synaptic input on the post-synaptic membranepotential when the input rate increases. Synaptic fatigue can beimplemented in a spike-based manner by reducing the effect of apresynaptic spike on the postsynaptic neuron if the spike arrives soonafter a previous one. Synaptic fatigue can be represented as:ΔV _(mem) ^(sp) ^(n) =A·G(t),with G(t)=W(t)·[1−F(t)], and F(t)=ƒ(t−t^(sp) ^(n−1) )+F(t^(sp) ^(n−1) ),where A is a fixed term specific to a pair of pre- and post-synapticneurons. G(t) is the synaptic efficacy, W is the weight, ƒ(x) is adecreasing function. In many cases, G(t) tends to W in the long term.However, W is uninfluenced by the fatigue.

FIG. 1C illustrates a circuit 102 implementing an FSTDP neuron,according to an embodiment of the present invention. In this circuit, apresynaptic neuron 140 is connected to the synapse 110, which is in turnconnected to the post-synaptic neuron. The pre-synaptic neuron generatesa series of spikes. The spikes are received by the weight component 111of the synapse 110. The output of the weight component 111 is receivedat the input of the fatiguing component 112. The output of the fatiguingcomponent 112 is subtracted from the output of thespike-timing-dependent plasticity component 111 at the differencecomponent 113. The output of the difference component is received in themembrane potential 151 of the post-synaptic neuron. The synaptic fatiguecauses the synaptic efficacy to be given by G(t)=W(t)[1−F(t)], whereG(t) is the efficacy, W(t) denotes the stored synaptic weight, and F(t)is a function that depends on the time of arrival of the presynapticspikes. An STDP component 120 modifies the weight component 111according to the timing of spikes from the pre- and the post-synapticneurons. A fatigue rule component 121 modifies the fatiguing component112 according to the function F(t). In the absence of presynapticspikes, F tends to zero as t tends to infinity and thus, G(t)→W(t). F(t)can be implemented, in an embodiment, in a spike-based way as a functionthat increases its value by a fixed amount upon the arrival of apresynaptic spike and decays exponentially.

FIG. 2 illustrates a device 200 having an integrate-and-fire neuron 230with a set 210 or array of fatiguing plastic synapses 211-214 attached.The integrate-and-fire neuron 230 equipped with the set of plasticsynapses 211-214 and an FSTDP learning rule can be used to detecttemporal correlations between event-based data streams. Each eventarrives as a spike at a corresponding synapse 211-214, and apostsynaptic potential is generated and added to the membrane potentialof the neuron 230. The temporal correlations between the presynapticinput spikes and the neuronal firing events result in an evolution ofthe synaptic weights due to a feedback-driven competition among thesynapses 211-214. In the steady state, the correlations between theindividual input streams can be inferred from the distribution of thesynaptic weights or the resulting firing activity of the postsynapticneuron 230.

Synapses with FSTDP can be implemented as a combination of one ofvarious forms of fatigue with one of various forms of STDP. A rate-basedfatigue component can be implemented such that fatigue increases with anincrease in the pre-synaptic input rate R(t). For example, the fatiguecomponent may have the form:

${F(t)} = {1 - {\frac{1}{{R(t)} + 1}.}}$The rate may be calculated as a mean over a time window in the past.This mean calculation can weight recent inputs more than distant ones.

Alternatively, a spike-based fatigue component can be implemented. Inthis approach, the fatigue component is given as:F(t)=ƒ(t−t ^(sp) ^(n) )+F(t ^(sp) ^(n) ),where t^(sp) ^(n) is the timing of the last presynaptic spike prior tot. The function ƒ(x) can be implemented as:step:

${f(x)} = \left\{ {\begin{matrix}{a,} & {0 < x < x_{window}} \\{0,} & {x \geq x_{window}}\end{matrix},{a \in \left( {0,1} \right\rbrack},} \right.$linear:ƒ(x)=1−ax, a>0,exponential: ƒ(x)=1−ae ^(−x) , a>0, or any decreasing function.

The STDP component can be implemented using any of a number ofapproaches, such as a classic STDP, a triplet rule, a Fusi rule, orother alternatives.

FSTDP allows for low-rate correlation detection. The necessary andsufficient conditions for successful correlation detection with aspike-timing-based plasticity is that the expected value of the weightupdates for temporally correlated synapses be positive, i.e., that theprobability of their potentiation exceed the probability of theirdepression, and vice versa for uncorrelated synapses:E[ΔW _(cor)]>0,E[ΔW _(uncor)]<0.This necessitates that the probability that each presynaptic spike of asynapse will bring the postsynaptic neuron to its firing threshold Vth,causing a postsynaptic spike is higher for temporally correlated spikeinputs than for the other inputs, even if the latter include ratecorrelations. The use of fatigue in FSTDP increases the potential thatthe membrane needs to be at for a presynaptic spike to cause it to fire,and this effect of FSTDP is smaller on low-rate inputs than on high-rateinputs. Thus, with FSTDP the probability of firing and potentiation dueto an input spike from a “correlated” synapse is higher than that due tospikes from “uncorrelated” synapses, even if the latter have high,correlated rates.

The above described FSTDP circuits can be implemented as softwaresimulations or as hardware emulations. Neural networks with anyvariation of FSTDP can be simulated in software on a variety of hardwareplatforms, such as CPU based systems, including architectures such asSpiNNaker, GPU based systems, or AI accelerators with synaptic dynamics.Simulated learning that updates programmable synapses on neuromorphichardware can also be used as well as learning simulated off-chip, suchas in IBM TrueNorth or CxQuad. (Note: the term(s) “SpiNNaker,” “IBM,”“TrueNorth,” and/or “CxQuad” may be subject to trademark rights invarious jurisdictions throughout the world and are used here only inreference to the products or services properly denominated by the marksto the extent that such trademark rights may exist.)

FSTDP can also be emulated in neuromorphic hardware withspike-timing-dependent long-term dynamics and with depressing short-termsynaptic dynamics, either in analog complementary metal-oxidesemiconductor (CMOS) or digital CMOS. Phase change memory or resistivememory, combined with transistors, can implement STDP. The use of phasechange memory has been implemented for 2T-1R and for 1T-1R. A signalprovided to the gate of a transistor, as part of each pre-synapticspike, can shape the amount of current passing through, according to afatigue rule defined by additional synaptic circuitry on-chip, oroff-chip. Additionally, a non-volatile memory element with volatilecharacteristics that match a form of fatigue. Volatility used forshort-term fatiguing effect, non-volatility for long-term STDP weightupdate. conventionally, long-term plasticity is combined with certainforms of short-term plasticity (but not fatigue), emulated throughmatching volatile dynamics.

The present invention, in embodiments, may be a system, a method, and/ora computer program product at any possible technical detail level ofintegration. The computer program product may include a computerreadable storage medium (or media) having computer readable programinstructions thereon for causing a processor to carry out aspects of thepresent invention.

The computer readable storage medium can be a tangible device that canretain and store instructions for use by an instruction executiondevice. The computer readable storage medium may be, for example, but isnot limited to, an electronic storage device, a magnetic storage device,an optical storage device, an electromagnetic storage device, asemiconductor storage device, or any suitable combination of theforegoing. A non-exhaustive list of more specific examples of thecomputer readable storage medium includes the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a static random access memory (SRAM), a portablecompact disc read-only memory (CD-ROM), a digital versatile disk (DVD),a memory stick, a floppy disk, a mechanically encoded device such aspunch-cards or raised structures in a groove having instructionsrecorded thereon, and any suitable combination of the foregoing. Acomputer readable storage medium, as used herein, is not to be construedas being transitory signals per se, such as radio waves or other freelypropagating electromagnetic waves, electromagnetic waves propagatingthrough a waveguide or other transmission media (e.g., light pulsespassing through a fiber-optic cable), or electrical signals transmittedthrough a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network may comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device.

Computer readable program instructions for carrying out operations ofthe present invention may be assembler instructions,instruction-set-architecture (ISA) instructions, machine instructions,machine dependent instructions, microcode, firmware instructions,state-setting data, configuration data for integrated circuitry, oreither source code or object code written in any combination of one ormore programming languages, including an object oriented programminglanguage such as Smalltalk, C++, or the like, and procedural programminglanguages, such as the “C” programming language or similar programminglanguages. The computer readable program instructions may executeentirely on the user's computer, partly on the user's computer, as astand-alone software package, partly on the user's computer and partlyon a remote computer or entirely on the remote computer or server. Inthe latter scenario, the remote computer may be connected to the user'scomputer through any type of network, including a local area network(LAN) or a wide area network (WAN), or the connection may be made to anexternal computer (for example, through the Internet using an InternetService Provider). In some embodiments, electronic circuitry including,for example, programmable logic circuitry, field-programmable gatearrays (FPGA), or programmable logic arrays (PLA) may execute thecomputer readable program instructions by utilizing state information ofthe computer readable program instructions to personalize the electroniccircuitry, in order to perform aspects of the present invention.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of thepresent invention. It will be understood that each block of theflowchart illustrations and/or block diagrams, and combinations ofblocks in the flowchart illustrations and/or block diagrams, can beimplemented by computer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein comprises anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the blocks may occur out of theorder noted in the Figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

While the foregoing written description of the present invention enablesone of ordinary skill to make and use what is considered presently to bethe best mode thereof, those of ordinary skill will understand andappreciate the existence of alternatives, adaptations, variations,combinations, and equivalents of the specific embodiment, method, andexamples herein. Those skilled in the art will appreciate that thewithin disclosures are exemplary only and that various modifications maybe made within the scope of the present invention. In addition, while aparticular feature of the teachings may have been disclosed with respectto only one of several implementations, such feature may be combinedwith one or more other features of the other implementations as may bedesired and advantageous for any given or particular function.Furthermore, to the extent that the terms “including”, “includes”,“having”, “has”, “with”, or variants thereof are used in either thedetailed description and the claims, such terms are intended to beinclusive in a manner similar to the term “comprising.”

Other embodiments of the teachings will be apparent to those skilled inthe art from consideration of the specification and practice of theteachings disclosed herein. The present invention should therefore notbe limited by the described embodiment, method, and examples, but by allembodiments and methods within the scope and spirit of the presentinvention. Accordingly, the present invention is not limited to thespecific embodiments as illustrated herein, but is only limited by thefollowing claims.

Some helpful definitions follow:

Present invention: should not be taken as an absolute indication thatthe subject matter described by the term “present invention” is coveredby either the claims as they are filed, or by the claims that mayeventually issue after patent prosecution; while the term “presentinvention” is used to help the reader to get a general feel for whichdisclosures herein that are believed as may be being new, thisunderstanding, as indicated by use of the term “present invention,” istentative and provisional and subject to change over the course ofpatent prosecution as relevant information is developed and as theclaims are potentially amended.

Embodiment: see definition of “present invention” above—similar cautionsapply to the term “embodiment.”

and/or: inclusive or; for example, A, B “and/or” C means that at leastone of A or B or C is true and applicable.

User/subscriber: includes, but is not necessarily limited to, thefollowing: (i) a single individual human; (ii) an artificialintelligence entity with sufficient intelligence to act as a user orsubscriber; and/or (iii) a group of related users or subscribers.

Module/Sub-Module: any set of hardware, firmware and/or software thatoperatively works to do some kind of function, without regard to whetherthe module is: (i) in a single local proximity; (ii) distributed over awide area; (iii) in a single proximity within a larger piece of softwarecode; (iv) located within a single piece of software code; (v) locatedin a single storage device, memory or medium; (vi) mechanicallyconnected; (vii) electrically connected; and/or (viii) connected in datacommunication.

Computer: any device with significant data processing and/or machinereadable instruction reading capabilities including, but not limited to:desktop computers, mainframe computers, laptop computers,field-programmable gate array (FPGA) based devices, smart phones,personal digital assistants (PDAs), body-mounted or inserted computers,embedded device style computers, application-specific integrated circuit(ASIC) based devices.

What is claimed is:
 1. An apparatus comprising: circuitry comprising: atleast one presynaptic artificial neuron configured to generate asequence of presynaptic spikes each having a respective timing; apostsynaptic artificial neuron comprising a membrane potential; and alearning rule circuit, comprising a synaptic efficacy and a synapticplasticity, configured to learn temporal correlations in the respectivetiming of the presynaptic spikes by discounting, using a rate-normalizedcovariance matrix, correlations in spike rates associated with thesequence of presynaptic spikes in the learning, the discountingincluding modifying the synaptic efficacy for clusters, in therate-normalized covariance matrix, having covarying frequency higherthan a preset threshold, using a learning rule based on a result of thelearning, wherein the learning rule is based on the respective timing.2. The apparatus of claim 1, wherein the learning rule circuit comprisesa normalized covariance circuit configured to generate therate-normalized covariance matrix based on the sequence of presynapticspikes and a clustering circuit configured to detect the clusters, of aportion of the sequence of presynaptic spikes, in the normalizedcovariance matrix and modify the synaptic efficacy according to thedetected clusters in the normalized covariance matrix.
 3. The apparatusof claim 1, wherein the learning rule circuit comprises at least oneplastic synapse that is operatively connected to the at least onepresynaptic artificial neuron to receive the sequence of presynapticspikes and modify the postsynaptic membrane potential based on thesynaptic efficacy and the sequence of presynaptic spikes.
 4. Theapparatus of claim 3, wherein the synaptic efficacy comprises aweighting and a rate normalization, and wherein the at least one plasticsynapse comprises a Hebbian plasticity circuit configured to modify theweighting, and a rate-dependent plasticity circuit configured to modifythe rate normalization.
 5. The apparatus of claim 4, wherein the Hebbianplasticity circuit comprises a spike-timing-dependent plasticity ruleand the rate-dependent plasticity circuit comprises a fatigue plasticityrule.
 6. The apparatus of claim 5, wherein the sequence of presynapticspikes is characterized by the spike rates, and the rate-dependentplasticity circuit, using a fatigue plasticity rule, is configured tomodify the rate normalization so that each spike in the sequence ofpresynaptic spikes has a reduced effect on the membrane potential whenthe spike rates increase.
 7. The apparatus of claim 3, wherein the atleast one plastic synapse comprises a non-volatile memory element withvolatile characteristics that match a form of fatigue.
 8. The apparatusof claim 3, wherein the at least one plastic synapse comprises a digitalcomplementary metal-oxide semiconductor (CMOS) circuit or an analog CMOScircuit.
 9. An apparatus comprising: circuitry comprising: at least onepresynaptic artificial neuron configured to generate a sequence ofpresynaptic spikes; and a postsynaptic artificial neuron comprising: apostsynaptic membrane potential; and at least one fatiguing plasticsynapse connected to the postsynaptic membrane potential, wherein thepostsynaptic artificial neuron is configured to receive a set of thesequence of presynaptic spikes, and the at least one fatiguing plasticsynapse comprises: an efficacy having a spike-timing-dependentplasticity; and a fatiguing circuit, wherein the fatiguing circuit isconfigured to reduce the efficacy using a fatigue plasticity rule basedon the set of the sequence of presynaptic spikes, by discounting, usinga rate-normalized covariance matrix, correlations in a spike rateassociated with the sequence of presynaptic spikes in a learning oftemporal correlations in a respective timing of the presynaptic spikes,the discounting including modifying the efficacy for clusters, in therate-normalized covariance matrix, having covarying frequency higherthan a preset threshold, based on a result of the learning.
 10. Theapparatus of claim 9, wherein the sequence of presynaptic spikes ischaracterized by the spike rate, and the fatiguing circuit, using thefatigue plasticity rule, is configured to modify the efficacy so thateach spike in the sequence of presynaptic spikes has a reduced effect onthe postsynaptic membrane potential when the spike rate increases. 11.The apparatus of claim 9, wherein the fatigue plasticity rule comprises:a function:F(t)=1−1/(R(t)+1), wherein R(t) is a spike rate at time t.
 12. Theapparatus of claim 11, wherein the spike rate comprises a mean over atime window.
 13. The apparatus of claim 9, wherein the fatiguingplasticity rule comprises: a function:F(t)=ƒ(t−t ^(sp) ^(n) )+F(t ^(sp) ^(n) ), wherein f(x) is a decreasingfunction, t^(sp) ^(n) is a timing of the last presynaptic spike prior tot, F(t) is a function that depends on a time of arrival of thepresynaptic spikes.
 14. The apparatus of claim 13, wherein:${f(x)} = \left\{ {\begin{matrix}{a,} & {0 < x < x_{window}} \\{0,} & {x \geq x_{window}}\end{matrix},{a \in {\left( {0,1} \right\rbrack.}}} \right.$
 15. Theapparatus of claim 13, wherein:ƒ(x)=1−ax, a>0 or ƒ(x)=1−ae ^(−x) , a>0.
 16. The apparatus of claim 9,wherein the plastic synapse comprises a transistor having a gate input,wherein the set of presynaptic spikes are input into the gate so as tomodify an amount of current passing through the transistor.
 17. Theapparatus of claim 16, wherein the modifying the amount of currentpassing through the transistor is in accordance with a fatigue rule. 18.The apparatus of claim 9, wherein the plastic synapse comprises anon-volatile memory element with volatile characteristics that match aform of fatigue.
 19. The apparatus of claim 9, wherein the plasticsynapse comprises a digital complementary metal-oxide semiconductor(CMOS) circuit or an analog CMOS circuit.
 20. An apparatus comprising:circuitry comprising: at least one presynaptic artificial neuronconfigured to generate a sequence of presynaptic spikes; a postsynapticartificial neuron comprising: a postsynaptic membrane potential; and atleast one plastic synapse circuit configured to receive a set of thesequence of presynaptic spikes; and a learning rule circuit configuredto: learn temporal correlations in a respective timing of thepresynaptic spikes by discounting, using a rate-normalized covariancematrix, correlations in spike rates associated with the sequence ofpresynaptic spikes in the learning, the discounting including modifyingthe postsynaptic membrane potential for clusters, in the rate-normalizedcovariance matrix, having covarying frequency higher than a presetthreshold, using a result of learning based on a normalized covarianceof the sequence of presynaptic spikes.